Recent Advances in Computational Linguistics
نویسندگان
چکیده
ive summarization approaches use information extraction, ontological information, information fusion, and compression. Automatically generated abstracts (abstractive summaries) moves the summarization field from the use of purely extractive methods to the generation of abstracts that contain sentences not found in any of the input documents and can synthesize information across sources. An abstract contains at least some sentences (or phrases) that do not exist in the original document. Of course, true abstraction involves taking the process one step further. Abstraction involves recognizing that a set of extracted passages together constitute something new, something that is not explicitly mentioned in the source, and then replacing them in the summary with the new concepts. The requirement that the new material not be in the text explicitly means that the system must have access to external information of some kind, such as an ontology or a knowledge base, and be able to perform combinatory
منابع مشابه
Automatic Short Answer Marking
Our aim is to investigate computational linguistics (CL) techniques in marking short free text responses automatically. Successful automatic marking of free text answers would seem to presuppose an advanced level of performance in automated natural language understanding. However, recent advances in CL techniques have opened up the possibility of being able to automate the marking of free text ...
متن کاملPredicting the N400 Component in Manipulated and Unchanged Texts with a Semantic Probability Model
Within the field of computational linguistics, recent research has made successful advances in integrating word space models with n-gram models. This is of particular interest when a model that encapsulates both semantic and syntactic information is desirable. A potential application for this can be found in the field of psycholinguistics, where the neural response N400 has been found to occur ...
متن کاملBridging language with the rest of cognition: computational, algorithmic and neurobiological issues and methods
The computational program for theoretical neuroscience initiated by Marr and Poggio (1977) calls for a study of biological information processing on several distinct levels of abstraction. At each of these levels — computational (defining the problems and considering possible solutions), algorithmic (specifying the sequence of operations leading to a solution) and implementational — significant...
متن کاملProceedings of the 17 th Meeting of Computational Linguistics in the Netherlands ( CLIN 17 )
I will review recent advances in grammar-based sentence realization from logical-form meaning representations. The LOGON MT prototype aims at the fully-automated, highquality translation of Norwegian instructional texts (on backcountry activities) into English. The LOGON generator operates off underspecified meaning representations derived from ‘deep’ grammatical analysis (in the LFG framework)...
متن کاملAutomatic Translation of Languages Since 1960: A Linguist's View
s of papers presented at the Second (1964), Fourth (1966), Fifth (1967),Sixth (1968) Annual Meetings of the Association for Computational Linguis-tics (formerly the Association for Machine Translation and ComputationalLinguistics) available at the Slavic Department, Wayne State University,Detroit, Michigan 48202.Edmundson, H.P. ed., Proc. Nat. Symp. Machine Translation, Los ...
متن کاملProducing a Persian Text Tokenizer Corpus Focusing on Its Computational Linguistics Considerations
The main task of the tokenization is to divide the sentences of the text into its constituent units and remove punctuation marks (dots, commas, etc.). Each unit is a continuous lexical or grammatical writing chain that is an independent semantic unit. Tokenization occurs at the word level and the extracted units can be used as input to other components such as stemmer. The requirement to create...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Informatica (Slovenia)
دوره 34 شماره
صفحات -
تاریخ انتشار 2010